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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
351 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
601 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTC AC ACAA TTTATTAATG AATTTTTCTA 
1201 CAATATACTC CCC AAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1301 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
1401 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
145 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 601 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1 701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGC A AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
220 1 AG AAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
2551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDDOFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKN1WDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
55 1 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNWIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
85 1 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKUYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KY3FNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
85 1 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTACAAC AGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
125 1 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAGAAATGG A GCCG AAATCT TAATC AAAAA -GAATTGCGTC GATATTGC AA 
140 1 AAGAATCG AA CTCT AAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG AC AATAAAAA 
1 60 1 ATAC AAACCT TGGTC AAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 
GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCA 
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YFLIHSTS IAALVVTRKDAK 
TS*YIQQV*QLL* * Q E R M Q N 
LPNTFNKYSSSCSDKKGCKT 

C ATTG AAATCTGG C TC G AAAT C GCCTTC ATTG ACT ATTC C AAAGTTGC AAAAAC AATT AG 
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GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 
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HCNLARNRLHCLFQSCKNN 
IEIWLEIAFIDYSKVAKTIR- 
LKSGSKSPSLTIPKLQKQLK- 

AGTTCT ACTTC TCGG ATGC AAATCTTTATAACGATTCTTTCTTG AG AAAATT AG TTTTAA 

241 + + + 300 

TC AAG AT G AAG AGCCTACGTTT AG AAAT ATTGCT AAG AAAGAACTCTTTTAATC AAAATT 

SSTSRMQIFITILSCEN'F* 
VLLLGCKSL* RFFLEK I5FK- 
FYFSDANLYNDS TLRKIiVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 
301 «■ -* ♦ + 360 

TTTCGCCTCT CG TTTCTC AT CTTT AAC TTTGT AATG ATT A C AAATTT ATTTT AGTC C ATT 

KAESKE*KLKHY "CLNKIR* 
KRRAKSRNCNITNV* I K S G N - 
SGEQRVE IBTLLMFK * N Q V M - 
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TGACtG ATT ATTCTATTTTTT AG ATCACTTCTTAAGGAGC ATT ATGG AG AAAATT ACTTAA 
ACTCCT AAT AAGATAAAAAATCTAGTGAAGAATTCCTCGT AAT ACCTCTTTT AATG AATT 
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CGLFYFLDHFLRS IMEKIT* 
E D Y S I F * ITS * G A L W R K L L N - 
RI ILFFRSLLKEHYG ENYLI- 

TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

42X + «- + * * 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

Y'KVNSLDYFPSQQCCVY-I 
TKR * T V W I IS LANNDEYI KF 
LKGKQFGLFP * PTMMSILNS- 
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CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 - * + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMS.QRIS IHQTYQRQTRY 

b ICECVKGSRY IRLTKDKLAI 

c YENESKOLDTSDLPKTNSL*- 

AAAACG C AAG AAAAAGTTTG AT AAT C G AA C AG C AG AAG AACTT ATTGC ATTT ACT ATTCG 

541 + + + + + + 600 

TTTTCCGTTCTTTTTCAAACTATTAGCT^ 

a KTQEKVC * SNSRRTYC I Y Y S 

b KRKKKFDNRTAEELIAFTIR- 

C NARKSLI I EQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 * + + + + + 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRRCTPESCDN 

b MGFITIVLGIDGELPSLETI- 

c WVLLQLF * VSTVNSRVLRQL- 

TX3AAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 * - <- + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTC^GACTACACATACGGTA 

a CKSCLQLKESQFCKF *CVCH 

b EKAVYNCRNRSSESSDVYAI- 

C KKLFTTEG IAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 ♦ + + --- + + + 780 

ATAAAACACTTAATTAGAGTT^ATAGAATAGAGTTAAAl^ACCTATCGATATCTTTGTTT 

a YFVN'SQISYLNLMDSYRNK 

b ILCINLKYLISI'WIAIETN- 

C FCELISNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 * + «. 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNG I YVKS FGTNAH 

b QINHASLMEYTLNPLGQMHT- 

c K'TX.QV'WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 -* «■ * * 900 

ACTTA^ATArAACCTAAG.-^TTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a C IYIGFLKHRYTECFRDCFS 

b EFILDS'SrOTQNALETDLA- 

C N L Y a" I L K A ~ IHRML * RLI * L- 

TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 * + * 960 

AATGTTGTCTrJiTGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQ ITCrDYSCSSL I SLKEA 

b YNRLPVuITLAHLLYL*KKQ- 

C TTDYLFCLLLLI SY I FKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 * * * - * 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEISKFVDSSVT 

b AKCKED'RKRFQNLL I L L * P - 

C RN EKKTKERD FK ICC FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 - * * * + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQSCF 

b ELTTR tLATK KKKSYHNPDS 

C N * QQEY'QPKRRRA ITILIL- 
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TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + **• * + «- «■ 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS 

C KDFKNSR * ERY I H * N S Y I IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + +■ + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS *QYFLISWK- 

c FHFTAVI F FYLNN I FCLAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + «■ + + * + 1260 

CATTTTTCATAGTTTATTCTCl^GCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR * LSLFTFI 

C KKYQIREALDCGNLAYSHS*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIRC*GNSSHPF-K 
b ORPSYIQYDDKETAVI RFKN 

C IDLHISNTMIRKQQS SVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + * + - - 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a * C Y E D * I F R V K K W S R N L N Q K 

b SAMRTKFLESRNGAE I LtKK- 

c VLCGLNF* SQEMEPKS * SKR- 

GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 - + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL'IFR* * V L P 

b NCVDTAKESNSKSFVNKYYQ- 

C IASILQKNRTLNLSLISITN- 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 
1 441 +. «. + «. 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILTDCRO* RGNCTEDH * R N K 

b SCLIEEIDEATAQKI IKEIK- 

c LOCLKRLTRQLHRRSLKK'5- 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 * ♦ + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN'RIN* ITNIEISDLQ 

b *LLLIRE*TKLLI 'RSAIFN- 

C NFY'LENKLNY * YR DQRSS I- 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 - 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK'XLN* S'TIKNTNLGQN 

b CRNXSCTKVRQ-KIOTLVKt- 

c DEISAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + * 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCCTTATTTATTTTACT 

a IEEGXEDQLAKEK IRQ * I KC 

b LRKEKKTS * Q K K K * G N K * N E - 

C CGRKRRPVSKRKNKA I S K H S - 
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GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 — * + + * + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQKCRNKRF I FFNNLLKRGV 

b YRSEEIKDLFFSI iyckegf- 

C TEVKK* KIYFFQ* FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 

b WGFGVLG- 

c G V L G F W 



FIGURE 13 



2 EVDVDNQADNHGIHSALKTCEEIKEAKTLYSWIQKVIRCRNQSQSHYKDL 51 
|:::. | :.::|: :| |.::| - ::. . | |..|.|.| 

19 ELELBKQENQNDIQVRVK. . . . IDDPKQY. , LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF . STGLMIELIDKCLVELL 100 

:: : .|.| ..|.. -1= • • I I -..::|.: 

63 DERRYIITKALL EVA ES DP EF I CQLAVY I RNELY I RTTTNY I VAF . 107 

101 SSSDVSDRQKLOCFGFQLKGNQLAKTHLLTALSTQKQYFFQDEWNQVRAM 150 

108 . . . . . CWHKNTOPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 I GN E L FRH L YTK Y L I FQ RT S EGT LVQ F CGNNVFDH LKVND K FDKKQ KGG A 2 00 

145 FDATF.FKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

.:.|| .::|..: . | . . . : : | . . | . . | : . | . : 

182 SEFNEYQLGKYCTES. .QRKKTMFRYLSVTNKQKWDOTKKK 220 

248 NRNNQFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 297 

|.: -:.| : | .: : :|:..: Mi- I- II 

221 . R KENLLTKLQAI KESEDKSKRETG DIMNVEDAIKALKPAVMKKI 264 

2 98 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENLINKTREEKSKYYEE 347 

1-1 :. I ..: 1-1 U :-: 
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FIGURE 19 



1 aactcattta attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
241 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttta caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca-aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggttcc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1741 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
2041 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcttaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
2221 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
234 1 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 
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1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact.gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1021 agatagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1 141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1 62 1 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatactttt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagcttag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatttacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactl aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 i tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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Motif 0 

AKFLHWLMS V Y WELI»RS FF YVTETTFQKHR 
ISE I EWLVLGKRS KAKMCLSDFEKRXQIF A£F I YVTL YNSFI I P I LQSFFY ITESS DL.RNR 
LKDFRWLFISD---IWFTKHKFENLNQLAlCFISWLFRQLIPKIIfiXFFVCTEl6STVt- 
TREISKMQVET-SAKHFyyFDHEN-IYVl,WKLLRWXFEDLWSLIRCFFyVTEQQKSySK 

. * *** ** 

Motif 1 

LFFYRKSVWSKLQSIGIRQHLKRVQLRDVSEAEVRQHREARPALLTSRLRPIPKP—DOL 
TV YFRKD I WKLLCRPF I -TShKMEAFEX I NEHNVRHDTQK-TTjLPPAV I RLLPKK«HTF 
IVYFRKDTWNKt.ITPFlVEVFXTyi*VEKMVCRNHNSyTLfl — NFNHSKMRl IPKK5HNEF 
TyyYRKHlWDVIKKMSI-ADLKKETLAEVQEKEVEEWKKS-LOFAPGKLRLIPKK~TTF 
* . * * .* •** 

Kotif 2 

RPlVNMDyWGARTFRREKRAERLTSRVKALP-SVLHYERA 

RL* TN- LRKRFLI KHOS HK RMLVS TN QTLRP VAS ILKHL I NEES SGI PFHLE VYMKLiLtTF 

RIlArPCRGADEEEFTIYKEHKKKAIQPTQKILEyLRNKRPTSFTKiySPTQIADRIKEF 

RPIMTFHKKIVNSDRKTTKLTXKXKLLRSHLM1-KTLKM-RMFKDPFGFAVFHYDDVMKKY 
» * * * 

• « 

Kotif 3 (A) 

KKDLLKHRKFGR-KXyFVRIDIKSCYDRIKQDLMFRIVKK-KLKDPEPVIRKYATIHATS 
KQRLLKKFNNVLPELYFMKFDVK8CYDSIPRHECMRILKD-ALKNENGFFVRSQYFFHTM 
EBFVCKWKQVGQPKLFFATMDIEKCYDSVKREKLSTFLKTTKLLSSDrWIMTAQILKRKM 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TTCTC CAACGC AAATAGCTGAC CGTAT C AAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 
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MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKfWVFEETVSKKRK 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWLVL 

GKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLPJ^TVYFRKDIWKLLCRPFIT 

SMKMEAFEKINENhTVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKXMLV 

LRPVASILKHLn^EESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKKDAKKFLNLSLRGFEKFI>^ 

FSWMRSLDTLLACPKIDEALFT^STSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 
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ggtaccgamacmccmcttcataagctaattgcttcctcgaacgctcct^ 
tgttattagtgatcgaiaatamctattttatcggtcgftacc^ 
ggttcgcttactmaatcgtggtactg^agctgctacttctag^ 
ctgatgagactatattagattcattacagtccgtgcatattcttaa^ 
aaaaggttgataattamgcaaaatcatgtccttagfggtggtaatcc^^ 
ctcctttaacgcggttttattmctatmctattctcatg^gttccaaatatgtatcatctcgtattaggcttt^ 
> aataatctaaattagmcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatg^^ 
aaaatcctataattataaatattaatcaatemgcggtcactatttatttaaaacgttatgatcagtaggacacmgc 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 
CCTTAAATGATTATGTACAACTT&TTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 
CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTAGTC 
AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATC 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 
ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaattgtgaaatamacctgcaattactgmca 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggttaagatgttgcgca 

agactgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTT^ 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTC 

aactaatactgttatccttcataactaatmagATCTATATTTTAACTTACACTCTAm 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tattgtataaaamattaccactaacgattttaccagACCTCGAAACTTTCTTGAAATTATCG 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaattaacaatcagATT^ 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 
AATTCATCTACTGGCTATACAATTCGTTTATAATAC^ 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGC 
TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtatmaaagtattttttgcaaaaagctaatattttcagAACAA 
TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 
CTTTCGTCTCATTACGAATTTAAGAAAAAGATTC 

cagATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTA 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTC 

TATTTTGTACGGATAGATATAAAATCCT^ 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagmatttmcattggaatttttt 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTT^ 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGgtataccaattgttgaattgtaataacactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGA 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 
AGGATGCAAAAAAATTTTTGAATTTATCTTTA^ 

ACACAATTTTTCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 
ATACTTTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAAC 
ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 
ATATGGGGAAATCTTTTTTTTACAAAATTCTAAGgtatactgtgteactgaataatagctga 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 
GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 
AAAGAATGTTCATAACGGgtgagtacttatmaactagaaaag 

ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 
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ATGgtacgtgtcggtctcgagacttcagcaatattgacacatcagGCTTTTTTGTCTTGGAATGAGAGATC 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTC 

tattactggtgtcttaaacaatattattactaagtatagctgacccccaaagca^ 

atccttatactmaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaacattaaaagtaatacatgaggctaatctcctttcattta 

aataaggaaagtggttttctataatgaataatgcccgcactaatgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatat 

acccagtgttgttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagccccaggttatccatggtggccg 

gccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcatttaatgtctta^ 

gtgttaagccattattggattccgaaatagccaaamcttggttcctcaaagcggaagto^ 

gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaatttmgcaaaaaagaaaatatcattgggagacat^ 
gagtatctccagcggatccttgatgtcaataacttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 
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EST2 pep 
Euplotes pep 

Trans of tetrahymen 
Consensus 



FFYCTEISST VtlVYFRHDT KLIT F FIVE YFK-TYLVEN 

FFYVTEQQKS YSKTYYYRKN IWDVI-MKMS IAD LKK ETLA--EVQE 

KKKE GSQIFYYRKP IWKLVSKLTI VKVRIQFSEK NKQMKNNFYQ 

FFY.TE..K. J..YYYKK. IW...-KL F. .K V. , 



40 
43 
44 

50 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



NVCRKHNSY 

KEVE&WKKSL 
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K. - .E - 



GFAPG K ft? 
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.F. .GKfc 



■i 



S — j^EIiOTLRK 



lEBiEUKTF.RK 



79 
78 
92 

100 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



ADEEBFTIYK ENKKNAIQ P^QKILE^kt^K 



L. 



RPTSFTKIVS PTQIADRIKE 

RMFK -DPFGFAVFN 

ML-G -QKIGYSVFD 

ilks- .ig. *yf 1 i 



129 
120 
130 

150 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



FKQRLLFKFN 
YD-DVMF | IE 
NK-QISEKF, 

% 



NVL 

EFVCKWKQVG q. 
'A QFIEKWXNKG 

.F. .KWK. *G 



D VKSCYD 
FATMD XEKCYD 
xYVTL- 

A 

E&F.T.D ...CYD 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RQHLKRVQLRDVSEAEVRQHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: A1CF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 
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Polv 4 

t t c 

ta a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B') 



o 

m 
si 
m 

w 

hi 
m 
w 

I 5 (C) 

D D Y L L I T 

3'- ctg ctg atg gag gag tag tgg -5' 
y a a aaaaa a a 

N i:: t t t t 

c c 
Polv 1 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

Ot LCVSYILSSFYYANLEENALQFIiRKESMDPEKPETNLLMRLT 

Ea_pl23 KGIPQGLCVSSIIiSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKI.ADDFLIIS 



^KVGIPQG 

"§ka aaa gtt ggt ate cct cag gg < Actual Genomic Sequence* 

t t c 
:ia a g c c teg 
:dag acc aaa gga att cca taa gg > 

U?ag acc aaa gga att cca tea ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
Si tc tgg ttt cct taa ggt agt ccG AGT TAA GAC AGT AGA AAA AAC aca GTA aag ata tag 

H- KGIPSGSILSSFLCHFYM 



GAT TTG ATT GAT GAA TAC CTA TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA 
CTT CTA AAC TAA CTA CTT ATG GAT AGC AAA TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 



ED LIDEYLSFTKKKGSVLLR 



GTA GTC gac gac tac etc etc ate acc 
CAT CAG ctg ctg atg gag gag tag tgg 

VVDDYLLIT 

< ctg ctg atg gag gag tag tgg 

a a aaaaa a a 
t t t t 
c c 

Poly i 

gac gat ttc etc ttt ata aca < Actual Genomic Sequence. 

D D F h F I T 
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3' RT PCR Strategy 




1 . Synthesis of cDNA with €ty Primer. 
mRNA 



o J 

m 
m 
m 

m 

w 
w 
w 



IAAAAAAAAAAAAAAAA. 

■TT T i ri 'TTi ,l r ri "r 'rTT ■ 



2. First Round PCR Using Outside Primer and Qq Primer. 
3 , mmmmmmmmammmmmmmmmmm ^ 



ITTTTTTTTTTTTTTT I 



U 3. Second Round PCR Using Inside Primer and Q| Primer. 

U - 



15' 



4. Sequence Second Round PCR Products Using Inside Primer or Q| Primer. 
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-Size Selected Libraries from P. Nurese 
3~4kb 
5~6kb 
7~8kb 
11 - 12 kb 

-Libraries from J.A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 

£ REP81ES Library (old) 

\\ REP81ES Library (new) 
REP41ES Library 





b.f: 



D- 



Original PCR 
n 3' RT-PCR 




500 bp 



Mg Cone 
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5' RT PGR Strategy 



ddA- 



mRNA 



■-P 



lAAAAAAAAAAAAAAAA. 



1 . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 
5' ■■■■■■ 

3' 



lAAAAAAAAAAAAAAAA. 



3* 



2. Ligate Oligo with 5'-P and blocked 3' to cDNA using T4 RNA Ligase. 




4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIW ...(35)... 
S.c. Est2p (366) . WLFRQLIPKIIQTFFYCTEISSTVT-Iv'YFRHDTW . . . (35) . . . 

E.a. pl23 (441) . WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW . . . (35) . . . 

* *** ** * * * 

Motif 1 Motif 2 K 
p hh h K hR h R 
S.p. Tezlp AVI RLLPKK — NTFRLITN- LRKRF ...(61)...' 
S.c. Est2p SKMRI I PKKSNNEFRI IAIPCRGAD ...(62)... 

E.a. pl23 GKLRL I PKK — TTFRPIMTFNKKIV ...(61)... 

* ** * * * * 

Motif 3 (A) AF 

h hDh GY h 
S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK ...(89)... 
S.c. Est2p ELYFMKFDVKSCYDS I PRMECMRILK ...(75)... 

E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 

* * *** * 

Motif 4(B') 

hPQG pP hh h 

S.p. Tezlp YLQKVGIPQGSILSSFLCHFYMEDLIDEYLSF ...(6)... 

S.C. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFY SEF . . . (8) . . . 

E.a. pl23 YKQTKGIPQGLCVS3ILSSFYYATLEESSLGF ...(14)... 
* * ** * * * 

Y Motif 5(C) Motif 6(D) 

h F DDhhh Gh h cK h 

S.p. Tezlp VLLRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENS .(205) 
S.C. ESt2p LILKLADDFLIISTDQQQVINIKKLAMGGFQKYI\AKAI^RDKILAVSSQS .(173) 
E.a. pl23 LLMRLTDDYLLITTQENNAVLFIEKLINVSRENGFKFl'MKKLQTSFPLS .(209) 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 
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1 . Transform with linear fragment ' 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 
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2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 
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(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIGURE 44 



An Example of Confirmation of tezl disruption By PCR 
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Fez 7 disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 



1 ggtaccgatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 
161 ccaagtataaggacaaaaagaacaacttccttccccctaaagacttttactttattaatttacttttcaaatatatttcg 240 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 
321 agctcttggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 
401 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 
561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 
641 ccaaatatgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 
721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 
801 gatactttgcaaaacatttattagctatcattatataaaaaaaatCCtataattataaatattaatcaatatttgcggtc 880 
881 actatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGO AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1MTEHHTPKSRILRFLENQYV 20 

l£gL9 TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
%1 YLCTLNDYVQLVLRGSPASS 40 

l|g79 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
U\l YSNICERLRSDVQTSFSIFL 60 

iJl39 CAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 1198 
^61 HSTVVGFDSKPDEGVQFSSP 80 



FIGURE 46 (cont) 



1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttttctattcgggatagctaatatatgggcag 1272 

81 K C S Q S E 86 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 

87 L I ANVVKQMFDE S FERRRN L 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatatttacctgcaattactgtttcaaagaga 1405 

107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 

114 NHEDFRAMHVNGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 

129 DLVSTFPNYLISI LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 

149 L L L E I I G 155 

l6$2 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

iS6 SDAMHYLLSKGS IFEALPND 175 

l4f2 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 

i5;6 NYLQISGIPLFKNNVFEETV 195 

.lt§2 TCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

W6 SKKRKRTIETS ITQNKSARK 215 

lfj2 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

M6 EVSWNSISISRFSIFYRSSY 235 

18J2 AAG AAG TTT AAG CAA G gtaactaatactgttatccttcataactaattttag AT CTA TAT TTT AAC 
lff)7 

UBS KKFKQD LYFN 245 

1908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 LHS ICDRNTVHMWLQWI F PR 265 



1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG 
266 QFGLIN AFQV 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC 
286 SQSTVVPKRL 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA 
306 AKRLHRISLS 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT 
326 DTHDDEKILS 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT 
346 FLRSILVRVF 



AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

KQLHKVI PLV 285 

CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

LKVYPLIEQT 305 

AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

KVYNHYC PYI 325 

TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

YSLKPNQVFA 345 

CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

PKLIWGNQRI 365 



2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgattttaccag AC CTC GAA ACT 2336 
366 FEIILKD LET 375 



FIGURE 46 (cont.) 



2337 TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 FLKLSRYESFSLHYLMSNIK 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 

396 ISEIEWLVLG 405 

2466 AAA AGG TCA AAT, GCG AAA ATG TGC.TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 KRSNAKMCLiSDFEKRKQIFA 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFIIPILQSFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2645 

446 I TESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

466 LLCRPFITSMKMEAFEKINE 485 

#706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

jC486 NNVRMDTQKT 495 

=1776 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

0U96TLPPAVIRLLPKKNTFRLIT 515 

01836 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 

id 516 N L R K R F h I K 524 

32907 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

i 525 MGSNKKMLVSTNQTLRPV 542 

Q2968 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

i 543 ASILKHLINEESSGIPFNLE 562 

3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 VYMKLLTFKKDLLKHRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 RKKYFVRIDI 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 KSCYDR IKQDLMFRI VKKKL 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 

612 KDPEFVIRKYATIHATSDRA 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 

632 TKNFVSEAFSYF 643 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

644 DMVPFEKVVQLLSMKT 659 

3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

660 SDTLFVDFVDYWTKSSSEI F 679 



3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 
680 KMLKEHLSGHIVK 692 
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3533 ctaatgaaactag ATA GGA AAT TCT CAA TAG CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 
693 IGNSQYbQKVGI PQGS 708 

3594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 
709 ILSSFLCHFYMEDLIDEYLS 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA AC A 3713 
729 FTKKKGSVLLRVVDDFLFI T 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 
749 VNKKDAKKFLNLSLRG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 
765 FEKHNFSTSLEKTV 778 

3841 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3900 
779 INFENSNGI INNTFFNESKK 798 

33 01 AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT GAT ACA TTG TTA GCA TGT 3960 
= 599 RMPFFGFSVNMRSLDTLLAC 818 

9961 CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4020 
QS19 PKIDEALFNSTSVELTKHMG 838 

#021 AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 
Lfi39 KSFFYKILR S848 

e4090 AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 
a ^849 SLASFAQVF IDITHNSKFNS 868 

'1150 TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4209 
/;J869 CCNIYRLGYSMCMRAQAYLK 888 

^4210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 
889 RMKDIFI PQRMFITD 903 

4275 aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 4339 
904 LLNVIGRKIWKKLA 917 

4340 GAA ATA TTA jGGA TAT ACG .AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 
918 EILGYTSRRFLSSAEVKW 935 

4402 ggtctcgagacttcagcaatattgacacatcag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 
936 LFCLGMRDGLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4528 
947 PSFKYHPCFEQLIYQFQSLT 966 

4529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4588 
967 DLIKPLRPVLRQVLFLHRRI 986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 
987 AD* 989 
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4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 4745 
4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccattaaacgggagtggttaaaca 4825 
4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagat ta tc t tc taaacaagggggat taagcatatccgaaggaaaagagagtaatatacccagtg 1 1 4985 
4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 
5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 
5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 
5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 
5386 aaaaaagaaaa tatcat tgggagacatc tc t tgatgaa tcaga tgcggagag tat c tccagcggat cc t tgatgtcaa t a 5465 
5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 



J II Mil fill 
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1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

SO 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 

GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 

CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG ,GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 

CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 

CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCAGCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pl23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

MotifO phhh K hRh R 

Ep pi23 . . . KSL GF APGKLRLIPKKT--TFRPIMTFNKKTV . . . 

Sp Tezl ...QKTTLPPAVIRLLPKKN--TFRLITNLRKRFL... 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD... 

Hs TCP1 ...ARPALLTSRLRFIPKPD-GLRPrVNMDYWG.. 
consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep pl23 . . .PKLFFATMDIEKC YDS VNREKLSTFLK. . . 

Sp Tezl . . . RKKYFVRIDIKSC YDRIKQDLMFRI VK.. . 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP1 ...PELYFVKVDVTGAYDTIPQDRLTEVIA...//... 
consensus F D YD 

Motif B hPQG pShh 

Ep pl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C hFDDhnh 

Ep pl23 ...PNVNLLMRLTDDYLLITTQENN... 

Sp Tezl ...KKGSVLLRVVDDFLFITVNKKD... 

ScEst2 ...SQDTLILKLADDFLIISTDQQQ... 

Hs TCP1 ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D GhhcK 

Ep P 123 ...NVSRENGFKFNMKKL... 

Sp Tezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

HsTCPl . . XRTL VRGVPE YGCV V . . . 
consensus G 



